Subword unit based speech recognition in car environments
نویسندگان
چکیده
This paper presents results of speaker-independent speech recognition experiments concerning acoustic front-ends, models and their structures in car environments. The database comprises 350 speakers in 6 different cars. We investigate whole-word models, contextindependent phoneme models and context-dependent within-word phoneme models. We studied task-dependent (same vocabulary context in training and test) phoneme models and present first results on task-independent (broad context in training, i.e. phonetically rich material) scenarios. The latter allows flexible vocabulary definition for applications with dynamically changing command words or new applications avoiding an expensive data collection. Acoustic preprocessing is carried out with mel-cepstrum combined with spectral subtraction and SNR normalization. The task-dependentword error rates are well below 3% for both wholeword and phoneme models. The task-independent scenarios have to be worked on further.
منابع مشابه
Speech Recognition Using Demi-Syllable Neural Prediction Model
The Neural Prediction Model is the speech recognition model based on pattern prediction by multilayer perceptrons. Its effectiveness was confirmed by the speaker-independent digit recognition experiments. This paper presents an improvement in the model and its application to large vocabulary speech recognition, based on subword units. The improvement involves an introduction of "backward predic...
متن کاملWord recognition using hidden Markov models and neural associative memories
for his interest in this thesis and his valuable advice. I thank my mentor Dr. Friedhelm Schwenker for his reading and his helpful recommendations. My thanks go also to Dr. Muhamed Qubbati and David Bouchain for a critical reading and for their useful suggestions. I also have to thank the Graduate School, University of Ulm whose doctoral scholarship financed this thesis. Further thanks go to my...
متن کاملData driven subword unit modeling for speech recognition and its application to interactive reading tutors
This paper proposes a novel token-passing search architecture for supporting subword unit based speech recognition and a corresponding algorithm based on the well-known LZW text compression method to determine a vocabulary of subword units in an unsupervised manner. We compare our subword unit selection algorithm to an existing approach based on Minimum Description Length (MDL) modeling and als...
متن کاملAre Initial / Final Units Acoustically Accurate ?
| We show a comparative study of subword unit segmentation of Mandarin speech data. Most HMM recognition systems use intial//nals as subword units for Mandarin speech. We nd that such a division of monosylla-ble data into intial//nal units are not always supported by acoustic evidences. We implement a delta MFCC based seg-mentation method and compare its output with that of Viterbi segmentation...
متن کاملCombined Optimisation of Baseforms and Model Parameters in Speech Recognition Based on Acoustic Subword Units
A major challenge in speech recognition is creating a lexicon which is robust to inter-and intra-speaker variations. This is even more so in speech recognisers based on non-linguistic units, e.g., acoustic subword units (ASWUs), since no standard pronunciation dictionaries are available. Thus the baseforms describing the vocabulary words in terms of the recognition units need to be generated fr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998